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In the Claims: 




1. (Currently ^Amended) A system for measuring an ability of a subject, comprising: 
a first set of task items that require the subject to provide one or more spoken responses; 
a speech recognimon system coupled to receive the spoken response and to provide an 

estimate of the spoken response, the speech recognition system having an associated non-random 
inaccuracy; 

a scoring device, th]p scoring device being operable to convert the estimate into an item 
score; and 

a computation device, the computation device providing a subject score based on a 
combination of item scores using a scoring computation model that depends upon an expected 
item-dependent operating characteristic of the speech recognition system, including the 
associated non-random in accuracy of the speech recognition system. 

2. (Original) A systemlas claimed in claim 1, wherein the scoring computation model is 
based on Item Response Theory. 

3. (Original) A system as ckimed in claim 1, wherein the speech recognition system, the 
scoring device and the computation ^evice comprise software modules running on a general 
purpose computing platform. 
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4. (Original) A system as claimed in claim 1 5 wherein the scoring computation model is 
constructed from a plurality of responses provided by a number of native and non-native 

being prompted by a second set of task items. 



speakers, the plurality of responses 



5. (Original) A system as 
speech recognition system comprises 



6. (Original) A system as 
tasks is an item selected from the 



claimed in claim 1, wherein the estimate provided by the 
an estimate of the linguistic content of the spoken response. 



claimed in claim 1 , wherein at least one task in the first set of 
I roup consisting of a prompt to read a sentence aloud, a prompt 
to repeat a word, a prompt to repeat a phrase, to repeat a sentence, a prompt to provide an 
opposite, and a prompt to answer a question. 



7. (Currently Amended) In a computer-based system that grades spoken responses to a 
set of task items, wherein the system comprises a speech recognition system, an improved 
method of grading the spoken responses, the improvement comprising: 

determining a subject score for the spoken responses to the set of task items, wherein the 
subject score accounts for an [ability] item-dependent operating characteristic of the speech 
recognition system to accurately recognise the spoken responses. 

8. (Currently Amended) A method pr measuring an ability of a subject, comprising: 
providing a set of task items; 
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recognize] from responses to the set o 



generating a difficulty value f >r each task item in the set, the difficulty value being based 
upon the task item and a performance measurement associated with an automatic device that 
measures task performance, wherein the performance measurement comprises a measure of [an 
ability of] an item-dependent operatir g characteristic of the automatic device [to accurately 



tasks; 




obtaining a response to each task item from the subject; and 

combining the difficulty values! and the responses from the subject to form a subject score 
reflecting at least one of a linguistic ability and [one] a cognitive ability of the subject. 

9. (Cancelled) A method^as claimed in claim 8, wherein the performance measurement 
is a measure of an ability of the automatte^device to accurately recognize the responses. 

10. (Original) A method as claimed in claim 8, wherein the step of generating a 
difficulty value comprises the step of obtaining a plurality of sample responses from a group of 
sample speakers. 



1 1 . (Original) A method as claimed in claim 10, wherein the step of generating a 
difficulty value further comprises the step of applying a statistical model to the plurality of 
sample responses. 
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12. (Original) A method as claimed in claim 8, wherein the step of combining the 
difficulty values and the responses comprises the step of applying a statistical model to the 
^plurality of responses. 



13. (Original) A method as 
associated with the automatic devic 
recognition system. 




in claim 8, wherein the performance measurement 
upon an operating characteristic of a speech 




14. (Currently Amended) A i^ethod for measuring an ability of a subject comprising: 
providing a set of tasks and a device that automatically measures performance of the 



tasks; 



determining a difficulty value fon each task, wherein the difficulty value is based upon the 
task and upon a performance measure associated with [an ability of the] an item-dependent 
operating characteristic of automated device [to accurately] in assessing performance of the task; 

obtaining verbal responses to the taslcs from the subject; and 

combining the verbal responses and the difficulty values to form a subject score reflecting 
at least one of a linguistic ability and a cognitive ability of the subject. 

15. (Original) A method as claimed in claipi 14, wherein the device comprises an 
automated speech recognition system. 



16. (Currently Amended) An apparatus for determining a difficulty value of items in a 
test, comprising: 
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a set of responses to the items from a number of individuals; 

an automated grader, wherein tl\e automatic grades receives the set of responses and 
provides graded responses; and 

means for reducing the graded responses to a set of item difficulties a said item difficulties 
normalizing the items by accounting for non-random errors by [reflecting an ability of] the 
automatic grader, [to accurately grade the set of responses.] 

17. (Currently Amended) A method fcf determining a difficulty value of items in a text, 
comprising: 

obtaining a set of responses to the items from a number of individuals; 
automatically grading the set of responses) thereby generating graded responses; and 
reducing the graded responses to a set of itebu difficulties, said item difficulties including 
a measurement of [accuracy for] an item-dependent operating characteristic associated with the 
act of automatically grading the set of responses for purposes of normalizing the items to provide 
an accurate assessment. 
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